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1.0 INTRODUCTION 
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The magnetic field experiment on the SIRIO spacecraft uses a 
triaxial set of sensors tc obtain precise measurements of the vector 
magnetic field (Ness et al., 1969) . An early assignment of telemetry 
allocated a total of 96 words of 8 bits each in the telemetry format, 
as shown in Figure 1. 

The sampling rate of the instrument was 4 times per frame, uni- 
formly spaced, so that the Nyquist frequency was 2 times the frame rate 
(of 1 per second) or fjj = 2 Hz. This frequency is twice the nominal 
spin frequency of the satellite but only slightly higher than the 
proton cyclotron frequency, fp = 1.7 Hz, assuming a field of 110 gamma. 
Time variations of the field of up to 5 0$ are expected so that fp will 
range between O .85 and 3*4 Hz. 

In mid-October, discussions of the possible use of more telemetry 
for the magnetic field experiment were held. The net result was that 
the sampling rate could be doubled in frames 0-6, but not in frame 7j 
and of equal importance an increase in the precision for each A/D 
conversion from 8 to 9 bits was also permitted. This substantially 
improves the experiment since the digitization uncertainty is now +85 
gamma/512 = +0.16 gamma while the Nyquist frequency is 4 Hz, well above 
the cyclotron frequency. 

However, since in frame 7 the sampling rate is only one half that 
of the other frames, a question arises about the effects in spectral 
analysis using data which periodically contain missing values. It is the 
purpose of this note to investigate this problem in the specific 
situation of the SIRIO experiment. 
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This note will also outline the general principles for 
analyzing the effects on spectral analysis of uniformly sampled data 
streams which contain missing values. 

2.0 MATHEMATICAL BASIS (Brae ewe 11. 1963) 

It is well-known that a continuous time function, f(t) whose 
spectrum F(f) is band limited from 0 to f^ Hz, is uniquely 
representable for all values of t by uniformly spaced discrete 
samples at a rate f s = f N * 2. 

That is, given f(t) then multiplication by 

+ 06 

E 6(t-kAt) 

K= - 09 

will produce a discrete time series at the sample points t^= kAt 

such that F(f) can be uniquely determined (At = l/% = l/2 %). 

6(t-t ) is an impulse at tM^such that 6 (t-t 0 ) = 0 everywhere except 

at t Q when it = ® and normalized so that. 

00 

J - 6(t-t 0 )dt - 1. 

- 00 

The spectrum F(f) is then replicated in the frequency domain so 
that the discrete time series spectrum is given by 

+ » 

(2.1) E F (f-k/At) 

k= - * 

For the analysis to be performed, use shall be made of the 
following three Fourier Transform theorems : 
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Addition : the spectrum of time series A plus time 

series B is simply the sum of ^he corresponding spectra, 

i.e. 

(2.2) F A+B (f) “ F A (f) + F B (f) 

Shifting : The spectrum of a time series A offset in time 

by an amount At is given by the produce of the spectrum 

and exp (-i2nfAt) i.e. 

(2-3) F A(t+At) (f) = *A(t) (f) * «‘ 12nf4t 

Similarity : the spectrum of a time series A', distorted in 

time linearly by a factor a, is given by the original 
spectrum distorted in frequency linearly by a“^ and in 
amplitude by l/|a| 

(2.4) F A’(f) 

where f^, (t) = f (at) 

3.0 APPLICATIONS 

In our study of the SIRIO uniformly sampled time series with 
periodic data gaps, we will assume the original time series, f(t), 
to be appropriately band limited so that F(f)=0 for f * f n where % 

is computed on the basis of 8 samples per frame, i.e. = 4 Hz. 

We assume that the repetitive time base for the time series is 
8 frames or one format. If data points were available at the 
uniform rate of f_, then the length of this time base would be 

8x8xAt = 64 a t. The real window can be considered to be of equal length 

but with some of the data points missing. 
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The SIRIO sampling window is then given by the following function: 

00 

(3.1) w(t) = 2 W. 6(t-kAt) 

k= - ® K 

where 


(3.2) 

W K = 1 

for 

-28 £ K (modulo 64) £ +28 


W K *1 

for 

K (modulo 64) = +30 


w K = 0 

for 

K (modulo 64) = +29; + 31 


w K - 0.5 

for 

K (modulo 64) = +32 


The origin has been selected so that the sampling 
window is an even function of time and the repetition rate 
is the format rate. (See Figure 2). 

The weights for the end values are one half because the 
window overlaps at these terminal points. 

Spectral analysis of the SIRIO data will yield 

a result which is the convolution of the true spectrum 
F(f) with the spectrum of the window given in (3.1). The 
spectrum of the window in 3.1 is readily obtained by the 
use of the three theorems given in (2.2) to (2.4). 

The principal effect to be noticed from application 
of the theorems is that the spectrum of the window (3.1) 
will contain lines (impulses of variable amplitude) at 
frequencies equal to l/64 of the sampling frequency. We 
shall refer to these as "parasitic lines". That is, the 
spectrum of (3.1) is 

(3.4) W SIRI0 (f) =2 b k 6(f - ^ f s ) 

- oo ^ 
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The values of the b^ are determined by the values of the as 


follows : 

( 3 . 5 ) 


32 

W SIRI0 (f) = 1 + S W K exp (1 2n K f/f„) 
k -0 


Note that if all the =■ 1.0 except for K (aodulo 64 ) =32 when 

%-0 then b^=0 for all K except K( modulo 64 = 0 

when b^ = 1.0. This of course is what would be expected 

since the completely filled data window for SIRIO should be 

invariant to a selection of time base for repetition. 


The actual time window for SIRIO is as given in (3.2) and 
the corresponding spectral window is shown in Figure 3a for 


0 £f £ f jj . 


The effect of the missing sample points is 
to introduce relatively small amplitude lines (< Tto of the 
main line at f=0). These lines will lead to a contamination 
of all spectral estimates since the estimated spectrum from 
the SIRIO data will be related to the true spectrum through 
the convolution of ( 3 . 3 ) with F(f). That is 


(3 - 6) ^estimated “ J\ F true< ? > * W SIRI0< f - 5 > d « 

— 00 


The contaminating lines are seen to be both small and of 
alternating sign. Thus, depending upon the exact properties 
of F(f), i.e. the position of lines and the general shape, 
F est W or may not be a valid approximation to F(f). 
Especially in the interpretation of the spectral results, 
should one be aware of . these contaminating effects. 
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As an example, if the true spectrum of F(f) contained 
only one line of amplitude A at f « a fjj» then the 
estimated spectrum, would be given by 

(3.7) F est (f) = A * W SIRI0 ( i a f^) 

and a whole set of parasitic lines would be computed and 
appear to be present in the estimated spectrum. 

One other note here is that in this treatment we have assumed 
an infinitely long time series so that the spectra are all 
composed of lines. In any real computation we shall be 
faced with use of a finite length record (time interval = T) , 
the effect of which is to broaden each line by convolution 
by the transform (sin (nfT)/TrfT) of the data window 
corresponding to the available time interval. In order to 
avoid overlapping the individual parasitic lines, at a 
spacing of f N , it would be necessary to consider using 
a time series on the order or greater than 32 times the 
sequence length. 

Another important note is that the present discussion has 
dealt exclusively with the amplitude spectra. It is most 
probable that in practice the power spectra will be computed. 
The effect of this is to multiply the true power spectrum 
by the square of the data window's transform. 

Thus the maximum contamination for the raw SIRIO window will 
be (0.065) 2 = 0.42$ of the main peak. At the same time, we 
should be aware that any overall amplitude spectral slope 
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such as f~ a will now become f“ 2< * and the problem of 
contamination is improved only to the extent that the 
alternating signs of the parasitic lines leads to 
cancellation of their individual contribution. This 
cancellation occurs only for the amplitude spectra. 

4.0 IMPROVEMENTS 

It is possible to consider improving the accuracy of SIRIO spectral 
estimates, at least to some degree, by using techniques similar to those 
employed for treating spurious data points which occur at random in 
any real experimental data set. These are due to a variety of causes, 
such as poor transmission, reception, encoding, decoding, etc. In these 
cases the conceptually simple and quantitatively defined method of 
numerical interpolation can be employed. 

In this method the value for a missing data point is obtained from 
a weighted sum of neighboring data points. Classical formulae used 
in the well known techniques of data smoothing (Hildebrand, 1956 ) are 
especially convenient for solution to this problem. 

As a first approximation consider using the zeroth order, 3 point 
formula : 

(4.1) yj. = \ (fj..! + fi + f i+1 ) 

where y^ represents the smoothed output for three data points, 
centered about the ith point. If in fact f^ is not available 
(as would be the case for K (modulo 64) = + 29 , +31 > then 
solving for f^ (assumed = y^) we obtain the estimate 

(4.2) = \ (fi.! + f i+1 ) 

Applying this to the case of SIRIO we have the following 
weights 
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W K - 1 for K (modulo 64) * -27 

Wft * 1+ i for K (modulo 64) * +?8 

W K « 2 for K (modulo 64)“ +30 

« 1 for K (modulo 64) " +32 

Substituting these values in (3.4) yields the corresponding 
data window shown in Figure 3b. Here is is seen that there 
is a substantial reduction in the amplitude of the 
parasitic lines for f < 0.5%, a slight increase for 0.5%< 
f < 0.75% 8,11 increase by almost a factor of two for 

f > 0.8%. 

This window is more attractive, from an analysis viewpoint, 
since contamination greater than 1$ does not occur until 
f/% > 0.6. This means that if the true spectrum tails off 
rapidly with frequency, such as for where a > 1> only the 
values near the folding frequency will possibly be 
contaminated. If the bandpass of the instrument is chosen 
to be less than %, say f LP 0,8 f N> then almost undistorted 
spectral estimates can be obtained. 

This technique of using smoothing formulae to estimate the 
values associated with % where K (modulo 64) = +29, +3l 
can be extended to higher order, which naturally requires 
additional data points . In those instances where the 
estimated value for %, K (modulo 64) - +31> depends upon 
the value of % It (modulo 64) = +29 or viceversa, then repeated 
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or chained use of the formulae is required. Given here 
without further derivation are the modified weights 
(which are non-unity) for the following cases: 


1st degree, 5 points 
See Figure 3c 


W 


W 


w 


w 


+25 

+27 

+28 

+30 


w 


32 


Also 1st degree, 
5 points 
See Figure 3d 


w +22 




W 

+32 


3rd degree, 5 points 
See Figure 3© 


3rd degree, 7 points 
See Figure 3f 


W 


W 


+28 

+30 


W 


+32 


W 


+24 


w 


w 


+25 

+26 


W +27 

W +28 

w +30 


w 


+32 


1/4 

1 + 1/4 

2 + 1/2 (4.4) 

2 

1/2 


i + 8 M 

1 + l6 /49 

i + h0 M 
1 + 1 /l4 

l + 1 /4 
2 1 /2 
3/4 


(4.5) 


(4.6) 


1 + 

529 

1-203 

520 

1-24 

529 

1+13 

529 
1 + 372 


1 + 1°^ 
529 

1 + JL 

2 23 


(4.6) 


The improvements, or lack thereof, can he seen in Figures 3c, 


3d, 3e and 3f respectively 
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5.0 COMMENTS 

As a part of this study, the effect of changing the location of the 
missing data points within the data set was investigated. The purpose 
here was to see whether or not an improvement was realised simply by 
redistributing the sampling times. The intuitively least influential 
distribution might seem to be, at first consideration, that of a uniform 
distribution of missing values. In the case at hand, SIRIO, this 
would be 7 consecutive values followed by one missing data value. We 
would expect this to introduce a partioning of the spectral window 
by eighths and it is precisely what happens, as seen in Figure 3g. 

Here the p rasitic lines, of alternating sign, have equal magnitude = 

6 , 5 $. 

We conclude that the spacing of gaps periodically is not an 
improvement since the cancellation enjoyed by alternating signs occurs 
now over a much wider frequency interval than for the present SIRIO 
distribution. In addition, the amplitudes are all at the same 
relatively large value. This result might have been anticipated if we 
had considered the effects of constructive interference of the missing 
values, with respect to this time distribution. 

The effect of estimating the values in the data gaps by using 
the simple formula in (4.2) yields the data window shown in Figure 3h. 
Again the amplitudes of the parasitic lines at frequencies near 
the main line f=0 are reduced while those near are increased. An 


improvement is made which is similar in spirit to that of cases 3a*, 3b. 
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If the missing points are immediately adjacent to each other 
then a slight alteration of these parasitic lines is effected. 

6,0 FUTURE STUDIES 

It is clear that the use of higher order formula for interpolation 
(and smoothing) of the data may lead to significant improvements 
beyond those already achieved. Possibly these shall be worth the 
computational effort and they definitely should be investigated. 

In addition it might be of interest to construct some artificial 
time series, representative of the anticipated data,* and examine the 
effects of these various windows. 
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9.0 APPENDIX A 

The following computer pointouts represent the results of the 
numerical evaluation of formula (3.5). The weights used are listed 
in the format: 





WORD = 




ui 

2 ^ 
ct o I s - 

cr 


NEW SI RIO FORMAT - Sampling at Words 0,8,16,24,32,40,48^56 

("FOR FRAME 7 OELETE 8.24,40,56) 






1 FORMAT 


k 






Fig. 3 
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